The use of typical sequences for robust speaker identification
نویسندگان
چکیده
Speech field is affected by accidental structures such as spurious events and artifacts (breath mouth, lip clicks etc). Because the whole utterance is used during training and the identification process, these factors may represent confusable acoustic classes which do not contribute to the performance of the speaker recognition system. In this paper we propose a new approach for extracting as much essential information as possible from the acoustic data in order to estimate more robust speaker models. To this end, we make use of the Asymptotic Equipartition Property (AEP) originated from the information theory to generate a reduced feature subspace termed ”typical set”. Results on the Spidre corpus show that the method leads to an appreciable improvement (averaging 10 % gain in performance) over the baseline system.
منابع مشابه
Combination of temporal trajectory filtering and projection measure for robust speaker identification
This paper presents a method that combines the techniques of temporal trajectory filtering and projection measure for robust speaker identification. The proposed robust feature, called Relative Autocorrelation Sequence Mel-scale Frequency Cepstral Coefficients (RAS-MFCC), is derived based on filtering the temporal trajectories of short-time one-sided autocorrelation sequences. This filtering pr...
متن کاملUsing Exciting and Spectral Envelope Information and Matrix Quantization for Improvement of the Speaker Verification Systems
Speaker verification from talking a few words of sentences has many applications. Many methods as DTW, HMM, VQ and MQ can be used for speaker verification. We applied MQ for its precise, reliable and robust performance with computational simplicity. We also used pitch frequency and log gain contour for further improvement of the system performance.
متن کاملUsing Exciting and Spectral Envelope Information and Matrix Quantization for Improvement of the Speaker Verification Systems
Speaker verification from talking a few words of sentences has many applications. Many methods as DTW, HMM, VQ and MQ can be used for speaker verification. We applied MQ for its precise, reliable and robust performance with computational simplicity. We also used pitch frequency and log gain contour for further improvement of the system performance.
متن کاملRobust portfolio selection with polyhedral ambiguous inputs
Ambiguity in the inputs of the models is typical especially in portfolio selection problem where the true distribution of random variables is usually unknown. Here we use robust optimization approach to address the ambiguity in conditional-value-at-risk minimization model. We obtain explicit models of the robust conditional-value-at-risk minimization for polyhedral and correlated polyhedral am...
متن کاملLikelihood Ratio Based Score Fusion for Audio-Visual Speaker Identification in Challenging Environment
It is well known to enhance the performance of noise robust speaker identification using visual speech information with audio utterances. This paper presents an approach to evaluate the performance of a noise robust audio-visual speaker identification system using likelihood ratio based score fusion in challenging environment. Though the traditional HMM based audio-visual speaker identification...
متن کامل